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A METHOD TO CLONE mRNAs AND DISPLAY OF DIFFERENTIALLY EXPRESSED TRANSCRIPTS (DODET) 
BACKGROUND OF THE INVENTION 

The human body is comprised primarily of specialised cells 
performing different physiological functions organised into 
5 organs and tissues. All human cells contain DNA, arranged in 
a series of sub-units known as genes. It is estimated that 
there are approximately 100,000 genes in the human genome. 
Genes are the blueprints for proteins. Proteins may perform a 

* 

wide variety of biological functions, for example messengers, 

10 catalysts and sensors. Such compounds are responsible for 

managing most of the physiological and biochemical functions 
in humans and all other living organisms. Over the last few 
decades, there has been a growing recognition that many major 
diseases have a genetic basis. It is now well established 

15 that genes play an important role in cancer, cardiovascular 
diseases, psychiatric disorders, obesity, and metabolic dis- 
eases. Significant resources are being focused on genomic 
research based on the notion that the nucleotide sequences of 
a particular gene and its predicted protein product will lead 

20 to an understanding of its function in healthy and malfunc- 
tioning cells or tissues. This understanding is expected, in 
turn, to lead to therapeutic and diagnostic approaches, 
focused on molecular targets associated with the gene and the 
protein it expresses. The first step on the way to the deve- 

25 lopment of such applications is to identify the genes speci- 
fically involved in the different categories of diseases. 
Application of this knowledge can produce new and valuable 
markers, identifying regions producing major diseases to be 
used for diagnostic and therapeutic benefit. 

3 0 Faced with the high complexity of the human genome, many 

approaches are being used to unravel the connection between 
primary gene structure and function. One well publicised 
approach is embodied in the Human Genome Mapping Project, 
where the sequence of all the individual genes in the entire 

35 human genome is painstakingly being determined. At the pre- 
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sent, however, little information can be directly retrieved 
on the function of the identified genes and still less about 
temporal and spatial expression patterns of the developing or 
mature organism. Other approaches, such as random cDNA se- 
5 quencing, involve the sequence determination of all genes 

expressed in a certain tissue, or developmental stage, of an 
organism. Like a number of other strategies, this is time 
consuming and prone to numerous problems . 

Although the flood of data from large scale sequencing pro- 
10 grammes is of enormous benefit to the scientific community, 
one of the major problems faced by such "shotgun" approaches 
is the lack of specific information that can be retrieved 
without significantly more work on the biology of each of the 
individual genes. 



15 Several other approaches have been taken by molecular biolo- 
gists to obtain more specific information on the genetic 
background of particular biological processes. Such approa- 
ches rely on a common concept. One gene, or a subset of 
genes, is switched on, initiating the healthy, pathological, 

20 or developmental status of an organ or cell type. 

In a large number of experimental systems the isolation of 
genes, on the basis of their differential expression, has 
been applied successfully. Differential screening and sub- 
tractive hybridisation of cDNA libraries have become well 

25 established, cf. Zimmerman et al . (1980) and Davis et al . 

(1979) . Differential library screening works well in practice 
for genes that are highly expressed, but mRNAs of low abun- 
dance are difficult to isolate. Subtractive hybridisation 
provides a more sensitive screening, but requires large 

3 0 amounts of RNA. More recently RNA fingerprinting methods 

(often referred to as differential display or DD/RT PCR) have 
been added to these tools, offering attractive new features 
for isolating genes. RNA fingerprinting methods are PCR based 
and therefore do not require large amounts of RNA for expe- 

35 riments. In addition to this, RNA fingerprinting methods 

allow a large number of RNA pools to be screened for specific 



WO 98/51789 PCT/DK98/00186 

3 

mRNAs simultaneously. Investigation of a wide range of patho- 
genic developmental stages and their controls would be pos- 
sible. To date, two methods of RNA fingerprinting have proven 
useful for isolating genes. In 1992 Liang et al- published a 
5 protocol (US Patent 5,262,311), soon after a protocol from 
Welsh et al . (1992) was presented. Both methods begin with 
cDNA synthesis from RNA using at least one arbitrary primer 
for the initiation of first and second strand synthesis. 

Welsh et al . (1992) designed a protocol in which the same 
10 arbitrary 20-mer oligo is used for first and second strand 

synthesis. Using arbitrary primers only a subset of the mRNAs 
are transcribed to cDNA. The cDNA pools are then used for a 
standard PCR with the same primers. One of the dNTPs in the 
PCR mix contains a radioactive label ( 35 S or 32 P) for visua- 
15 lisation of the PCR fragments with PAGE . The Liang and Welsh 
methods rely on at least one small arbitrary primer for 
selection of specific cDNAs. As a consequence annealing 
temperatures are low (~40°C) , and all amplified cDNA frag- 
ments originate from a certain degree of mismatch priming. 
20 Later several groups produced refinements and optimisations 
leading to a plethora of articles describing the usefulness 
of the method (Bauer and Warthoe et al. 1993; Warthoe et al. 
1995; Liang and Warthoe et al . 1995; Rohde and Warthoe et al . 
1996) . 

25 OBJECT OF THE INVENTION 

It is an object of the present invention to provide new 
methods and means for investigating the expression patterns 
in cells, especially in eukaryotic cells. The results of such 
investigations may be used in drug development, gene discove- 
30 ry, diagnosis of diseases etc., and therefore such improved 
methods are highly desirable. 
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SUMMARY OF THE INVENTION 

In its broadest scope, the invention pertains to a method for 
preparing a sub -divided library of amplified cDNA fragments 

from the coding region of mRNA contained in a sample, the 
method comprising the steps of 

a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer having the 
general formula 

5 s - Coil! -dT n2 ^3-^4-3' 

10 wherein Con x is any sequence between 1-100 nucleotides, 

dT is deoxythymidinyl , V is A, G or C, N is A, G, C or T, 
n2 is an integer s 1, n3 is 0 or 1, if n3 is 0 then n4 is 
0, and if n3 is 1 n4 is an integer 2= 0, thereby obtaining 
first strand cDNA fragments, 

15 b) synthesizing second strand cDNA complementary to the 

first strand cDNA fragments by use of the first strand 
DNA fragments as templates, and a second cDNA primer with 
the general formula 

5 ' -Con 2 -N x _3 ' 

wherein Con 2 is any sequence between 1-100 nucleotides 
and can be different or identical to con x , N x is A, G, T 
or C, and x is an integer a 0, in a appropriate enzyme/ - 
buffer solution which comprises the DNA pol I enzyme or 
the Klenow fragment of the DNA pol I enzyme, all four 
deoxyribonucleoside triphosphates and standard buffer and 
temperature conditions, thereby obtaining double stranded 
cDNA fragments, 



20 



25 



c) subjecting the cDNA fragments obtained in step b) to a 
molecular amplification procedure so as to obtain ampli- 



i 
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fied cDNA fragments, wherein is used a set of amplifica- 
tion primers having the general formula 

5' -Con 3 -N nl -3' I 

wherein Con 3 is a sequence identical to either Con z or 
5 Con 2 or both, N is A, G , T or C, and nl is an integer ^ 

0, wherein at least one set of primers has the general 
formula I where n > 0, said at least one set being ca- 
pable of priming amplification of any nucleotide sequence 
complementary in its 5' -end to Con x or Con 2 . 

10 This method is advantageous for amplifying very small amounts 
of RNA. Using the method of the invention it is possible to 
perform gene -profile analysis from less than 100 cells equal 
to 10 " 9 gram total RNA (10 pgram RNA per cell) . 

In a further aspect, the invention relates to a method for 
15 preparing a sub-divided library of amplified cDNA fragments 
from the coding region of mRNA (which may be of prokaryotic, 
Archae or eukaryotic origin) contained in a sample, the 
method comprising the steps of 

a) subjecting the mRNA derived from the sample to rever- 
20 se transcription using at least one cDNA primer, thereby 

obtaining first strand cDNA fragments, 

b) synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, thereby obtaining double 

25 stranded cDNA fragments, 



c) digesting the double stranded cDNA fragments with at 
least one restriction endonuclease, thereby obtaining 
cleaved cDNA fragments, 
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d) ligating at least two adapter fragments to the clea- 
ved cDNA fragments obtained in step c) , so as to obtain 
ligated cDNA fragments, and 

e) subjecting the ligated cDNA fragments obtained in 

5 step d) to a molecular amplification procedure so as to 

obtain amplified cDNA fragments, wherein is used, for an 
adapter fragment used in step d) , a set of amplification 
primers having the general formula 

5' -Com-N nl -3' . II 

10 wherein Com is a sequence complementary to at least the 

5' -end of an adapter fragment which is ligated to the 3'- 
end of a cleaved cDNA fragment, N is A, G, T, or C, and 
nl is an integer s> 0, and wherein at least one set of 
primers has the general formula II where nl > 0, said at 

15 least one set being capable of priming amplification of 

any nucleotide sequence ligated in its 3' -end to the 
adapter fragment complementary in its 5' -end to Com. 

The overall advantage of the invention compared to the prior 
art is that the resulting library of cDNA fragments contains 

20 nucleic acid sequences from all parts of cDNA which is pro- 
duced in step a). Prior art techniques which i.a. rely on 
poly-dT cDNA priming have a tendency to only yield fragments 
derived from the long untranslated regions of mRNA. Further- 
more, by fine-tuning of the conditions in each step, the 

25 method of the present invention results in highly specific 
reproduction of sequence information which is present in 
mRNA, even in mRNA which is only present in relatively low 
amounts. Furthermore, by choosing the optimum composition of 
endonuclease (s) it is possible to obtain cDNA fragments which 

30 are derived from a very large percentage of the total number 
of transcribed genes in relevant cells. 

The present method allows the targeted visualisation of known 
genes by using primer combinations, corresponding to sequen- 
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ces from the gene of interest. This has the advantage that 
all steps of the procedure and the biological system can 
easily be verified. Also, very specific expression analyses 
can be carried out on related genes with very high homology 
5 which could not be achieved by using hybridisation technolo- 
gy. 

Briefly, further steps in the method of the invention involve 
isolation of bands of interest from a gel, their cloning and 
sequencing. The sequence information allows re-amplification 
10 of individual bands, using primers with the appropriate 3-4 
nucleotide extensions. When run on a gel, these reactions 
will show one, or only a few, bands per lane, giving an 
unequivocal determination of band identity. 

Since the present technology makes use of end labelled pri- 
15 mers for visualization, the technology can be used, both with 
standard technologies involving radioactivity, or with fluo- 
rescent labelled primers, without the need for further opti- 
misation . 

The invention also pertains to methods for detecting diffe- 
20 rences between expression level (s) in cells which have been 
subjected to different conditions, methods for diagnosing 
disease, and methods related to "bioinf ormatics" wherein are 
used a combination of output from the above -disclosed method 
and data obtained by computer- simulation of corresponding 
25 treatment of well-defined stretches of nucleic acids. 

A separate part of the invention pertains to a novel method 
for performing reverse transcription, methods which yield 
considerably enhanced quality in the reversely transcribed 
material. Also means for carrying out this separate part of 
30 the invention are disclosed. 

In the following is given a short discussion of terms used in 
the present application: 
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"A sub-divided library of amplified cDNA fragments" is in the 
present context a library of amplified cDNA fragments which 
is split into a number of separate pools, each pool being 
defined by the sequences of the termini of the amplified 
5 fragments. For example, one pool may contain amplified frag- 
ments which are all characterized by having the sequence 5'- 
Com-AGC- in one of the strands, whereas another pool contains 
amplified fragments having the sequence 5'~Com~AAT in one of 
the strands. For a discussion of the meaning of "Con" and 
10 "Com", cf. below. 

"A normalised library" is a library containing substantially 
equal representation of each mRNA, i.e. approximately the 
same number of copies of each mRNA. 

"Reverse transcription" has its usual meaning in the art, 
15 i.e. synthesis of DNA using RNA as a template and effected by 
an enzyme having reverse transcriptase activity. 

"Adapter fragment" is intended to mean a nucleic acid sequen- 
ce containing a known sequence which can be used as template 
for a primer in a subsequent molecular amplification proce- 

2 0 dure such as PCR. The adapter fragment is further characte- 
rized by its ability to become integrated at the end of a 
cDNA fragment which has previously been cleaved with a re- 
striction endonuclease in step c) . In most cases, the re- 
striction endonuclease leaves fragments having "sticky ends", 

25 to which the adapter fragment will anneal readily, and there- 
after the adapter fragment becomes ligated to the cDNA by the 
action of a DNA ligase. 

DETAILED DISCLOSURE OF THE INVENTION 

In the following, the impact of each of the steps will be 
30 discussed in detail, see Figures 1 and 7. 
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The goal in step a) is to produce a mixture of first strand 
cDNA fragments which is optimized in its composition for 
carrying out the subsequent steps. A number of considerations 
5 apply: 

First of all, to reduce the "background noise", it is prefer- 
red that the annealing of cDNA primer to the RNA in step a) 
is performed under high stringency conditions, thereby en- 
suring that a minimum of mismatches are introduced in the 
10 cDNA relative to the mRNA, i.e. at a temperature above 50 °C. 

Secondly, it is desirable to obtain copies of sequences which 
are derived from all parts of mRNA in order to obtain infor- 
mation relating to the translated part of the mRNA. Prior art 
methods for reverse transcription of eukaryotic material have 
15 often utilised poly-dT as cDNA primers. This strategy has, 
however, the disadvantage that the most efficiently reverse 
transcribed material is situated in the untranslated part of 
the genes of interest. Hence, the only parts of the mRNA 
which become "visible" after e.g. a PCR procedure will very 

2 0 often be derived from untranslated regions of the RNA. The 

reason for this is two effects. First of all, the poly-dT 
approach has the consequence that the initiation point of 
reverse transcription is situated very far from e.g. the 
start codon relating to the operon in question. Secondly, the 
25 mRNA may include structures (e.g. "hairpin" structures due to 
intra- chain base-pairing) which block reverse transcription 
and by always initiating reverse transcription at one termi- 
nus of a gene, such structures will statistically block 
reverse transcription of a number of translated regions. 

3 0 It is in the present invention preferred to ensure that cDNAs 

are produced in step a) which are representatives of the 
entire gene, including the translated regions. 
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This can be obtained in a number of ways. If poly-dT priming 
(or a variation thereof) is used, it is preferred to perform 
the reverse transcription at an elevated temperature, e.g. in 
the range from about 45 °C to about 95 °C, and to use an enzyme 
5 having reverse transcriptase activity at said temperature. 
Normally the temperature will be higher than 45°C, e.g. at 
least 50°C, or even higher, e.g. at least 55°C, at least 
60°C, at least 65°C or even higher, e.g. at least 70°C. This 
approach has the effect that the elevated temperature ensures 
10 that e.g. hairpin structures are "stretched out" during the 
reverse transcription step, thereby avoiding the lack of 
reversely transcribed fragments upstream of such structures. 

Known enzymes having reverse transcriptase activity at such 
elevated temperatures are enzymes selected from the group 
consisting of DNA polymerases derived from thermophilic 
eubacteria, such as the polymerases Taq (Thermus aquati-cus) , 
Stof fel (Thermus aquaticus) , Tht (Thermus thermophilus) , 
Tf 1/Tub (Thermus flavus) , Tru (Thermus Ruber) , Tea (Thermus 
caldophilus) , Tfil (Thermus filiformis) , Tbr (Thermus Brocki- 
anus) , Bst (B. Stearothermophilus) , Bca (B. Caldotenax YT-G) , 
Bcav (B. Galdovelox YT-F) , FjSS3-B.l (Thermotoga FjSS3-B.l), 
Tma (Thermus Maritima) , UITma (T. Maritima) , Tli (T. Litora- 
lis) , Tli exo- (T. Litoralis) , 9°N-7 (Thermococcus sp.), BG-D 
(Pyrococcus sp.), Pfu (P. furiosus) , Pwo (P. woesei) , Sac (S. 
Acidocaldarius) , Ssol (S. Solf ataricus) , Tac (T. Acidophi- 
lum) , and Mth (Methananococcus Voltae) . 

One minor disadvantage of using these thermostable enzymes is 
that they have a tendency to be relatively ineffective com- 
pared to the "traditional" non- thermostable, reverse tran- 
3 0 scriptases. Hence, especially if priming of the reverse- 
transcription is not limited to the use of poly-dT primers, 
it is according to the invention possible to use non- thermo- 
stable, reverse transcriptases. Hence, in other preferred 
embodiments, the reverse transcription is carried out at a 
35 temperature in the range from about 25 °C to about 55°C by use 
of an enzyme having reverse transcriptase activity at said 
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temperature. Normally the temperature will not exceed 50°C, 
and usually it will be lower, such as at most 47°C / at most 
45°C, at most 43°C, at most 40°C / and at most 35°C. The 
reverse transcriptase can e.g. be selected from the group 
5 consisting of the reverse transcriptases from AMV (Avian 
Myeloblastosis Virus) , M-MuLV (murine M-MuLV pol gene) , and 
HIV-1 (HIV virus) . 

According to the invention, the most preferred way of carry- 
ing out step a) is to carry out reverse transcription in two 

10 subsequent steps, the first step comprising carrying out 

reverse transcription at the temperature conditions described 
above for non- thermostable enzymes, and the second step 
comprising carrying out reverse transcription at the tempera- 
ture conditions described above for thermostable enzymes. 

15 Normally this can be accomplished by having two non- identical 
enzymes present in the reverse transcription reaction, espe- 
cially because the non- thermostable enzyme will be inactiva- 
ted by the increase in temperature which is introduced when 
going into step 2* Of course, the enzymes can be added for 

20 each reaction step, but it is preferred that both enzymes are 
present from the start of the reaction. 

It is especially preferred that the activity of the enzyme 
which is active in the first step is substantially abolished 
in the second step (e.g. as a consequence of temperature 

25 denaturing of that enzyme) , or expressed otherwise, that in 
the second step the enzyme used in the first step is substan- 
tially inactive. In general, it is preferred that the enzymes 
used in each step are substantially more active in the rele- 
vant temperature range than the one wherein the other enzyme 

30 is used. 

In a preferred embodiment the reaction mixture with the 
sample comprises a cDNA primer, said cDNA primer being suffi- 
ciently complementary to the target RNA present in the sample 
to hybridize therewith and initiate synthesis of a single 
35 stranded cDNA molecule complementary to said target RNA and 
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the reaction mixture comprises an appropriate buffer which 
comprises all four deoxyribonucleoside triphosphates and a 
divalent cation selected from the group of Mg +2 and Mn 2+ in a 
concentration between 0.1 and 5 mM. 



5 In fact, it is believed that the above strategy for conduct- 
ing reverse transcription by use of two enzymes having diffe- 
rent temperature optima and of which one has a temperature 
optimum at which impeding structures in the RNA are "stretch- 
ed out", is novel and inventive in its own right. 

10 Preferred combinations of enzymes in this embodiment of the 

invention are that the enzyme effecting reverse transcription 
in the first step is MMuLV, AMV, HIV-1 and/or the enzyme 
effecting reverse transcription in the second step is Tth or 
Taq. 

15 An object of the method of the invention is to obtain a 

subdivision of the cDNA produced. When the mRNA is derived 
from a eukaryotic system, the at least one cDNA primer may 
include an oligo or poly dT tail in the 5' -end, having the 
general formula 5 ' -dT n2 -V n3 -N n4 - 3 ' , wherein dT is deoxythymi- 

20 dinyl, V is A, G, or C, N is A, G, C, or T, n2 is an integer 
a 1, n3 is 0 or 1, if n3 is 0 then n4 is 0 , and if n3 is 1 
then n4 is an integer s 0 . It will be clear that when n3 and 
n4 are both zero, then the primer is an ordinary poly- or 
oligo-dT cDNA primer. However, when n3 is 1, then the primer 

25 is in fact a primer composition which will be able to prime 
the reverse transcription of any mRNA having a poly-A tail. 
If the original sample of RNA is subdivided, and each sub- 
pool is subjected to reverse transcription which uses one of 
the possible primers having the above formula where n3 is 1, 

30 then the result is a number of single stranded cDNA pools 
which are each different from each other in the 5' -end. 

For example, when n3 is 1, 3 x 4 n4 groups of cDNA primers are 
used, each group being distinct from any one of the other 
groups with respect to the structure -V n3 -N n4 -. In such an 
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embodiment the pool of mRNA is conveniently subdivided into 
3 x 4 n4 aliquots which are each subjected separately to step 
a) utilising one of the 3 x 4 n4 groups of cDNA primers, ' 
thereby obtaining a subdivision of the first strand cDNA into 
5 3 x 4 n4 separate pools. Normally n4 will be 0 or 1, resulting 
in the provision of 3 or 12 pools, respectively. 

When the starting material is not eukaryotic or when it is 
not the intention to necessarily set out from the part of the 
transcribed gene which is most remote relative to the trans - 

10 lation start codon, the at least one cDNA primer does not 

include a poly or oligo dT tail in the 5' -end, or, alternati- 
vely, at least two cDNA primers are used of which at least 
one includes a poly or oligo dT tail in the 5' -end and of 
which at least one second does not include a poly or oligo dT 

15 tail in the 5' -end. Preferably, the cDNA primer which does 
not include a poly or oligo dT tail in the 5 ' end has the 
following structure 

5'-N x TTA-3' or 5'-N x CTA-3' or 5'-N x TCA-3', 

20 wherein N is A, G, T, or C f and x is an integer 1 <z x <, 20. 
It will be clear that this corresponds to cDNA priming set- 
ting out from any translation stop codon. As for the above 
embodiments utilising a poly- or oligo-dT tailed primer, it 
is, by preparing primers having all possible permutations 

25 represented in the group N x , possible to compose the primers 
so as to correspond to any possible sequence preceding a stop 
codon, thereby ensuring priming of all sequences having a 
stop codon in their sequence. 

Step b) 

3 0 This step is carried out by methods well known in the art. It 
is, however, preferred that step b) is carried out under con- 
ditions which minimize the formation of mismatches between 
nucleotides in the first and second cDNA strands. The double 
stranded cDNA procedure can be performed according to stan- 
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dard methods as described in Sambrook et a.1, (1989). However 
since standard polymerases can have difficulty in synthesi- 
sing regions containing secondary structures or with high GC- 
content, thermostable RNase H (Hybridase Thermostable RNase 
5 H, US 5,268,289) and thermostable rBst DNA polymerase from 
Bacillus stearothermophilus help overcome some of the limita- 
tions that standard polymerases (low temperature polymerases) 
suffer from. 

Step c) 

10 In one embodiment of the invention the ligated cDNA fragments 
obtained in step b) are subjected to a molecular amplifica- 
tion procedure so as to obtain amplified cDNA fragments, 
wherein is used a set of amplification primers having the 
general formula 



15 



5' - Con 3 -N nl -3 f 



20 



wherein Con 3 is a sequence identical to either Con x or 
Con 2 or both, N is A, G, T or C, and nl is an integer a 
0, wherein at least one set of primers has the general 
formula I where n > 0, said at least one set being ca- 
pable of priming amplification of any nucleotide sequence 
complementary in its 5' -end to Con-L or Con 2 . 



In another embodiment, after the preparation and optional 
subdivision of the mRNA, each of the different pools of cDNA 
is digested with at least one restriction enzyme to produce 
25 fragments of a size which can be separated using an appropri- 
ate size fractionation method. 



30 



The choice of restriction enzyme is based largely on the 
frequency of the cleavage sites in a given cDNA pool. Too 
many cleavage sites in each cDNA fragment will result in too 
small fragments, and vice versa. Optimally, the at least one 
enzyme should cleave every cDNA to yield fragments of the 
desired size. Statistically, it is not possible to cleave 
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every cDNA, but on the other hand a very large percentage can 
be cleaved by choosing a suitable enzyme or combination of 
enzymes. It is preferred that the method of the invention 
utilises at least one restriction enzyme chosen so as to 
5 ensure that at least 60% of cDNAs are cleaved, but higher 

percentages such as at least 65%, at least 70%, at least 75% , 
at least 80%, or even at least 85% are more preferred. 

Preferably the invention should use restriction enzymes that 
leave protruding ends (sticky ends) at the termini of the DNA 
10 after digestion in step c) , since this greatly facilitates 
the introduction of the adapter fragments in step d) . 

As will appear from the above, the frequency with which the 
restriction endonuclease cleaves is important. The at least 
one restriction enzyme is preferably chosen so as to cleave 
each complete cDNA into an average of about 3 fragments. It 
will be understood that some cDNAs obtained from preceding 
steps will not be cut at all (although this is a rare inci- 
dence when the restriction enzyme (s) is/are carefully chosen) 
whereas others will be cut with a high frequency. It has come 
out that use of a rare 4 base cutter as at the least one 
restriction endonuclease (such as the 4 base cutter Acil, 
Alul, Bfal, BstUI, Csp6I, Dpnl, Dpnll, Haelll, Hhal, HinPlI, 
Hpall, Mbol, Mill I, Msel, Mspl , Nlalll, Rsal, Sau3AI , Tail, 
TaqI, and Tsp509I) ensures the optimum performance of the 
inventive method. By use of such a rare 4 base cutter, the 
use of only 1 restriction enzyme in step c) is sufficient and 
results in superior output . 

Alternatively, a combination of restriction endonucleases can 
be used wherein a balance of e.g. 6 base cutters and 4 base 
30 cutters ensures a reasonable distribution of fragment sizes. 
For instance the use of a first restriction enzyme (e.g. a 6 
base cutter) which statistically cleaves at least 20% of 
complete cDNA derived from the mRNA sample into two subfrag- 
ments, and of a second restriction enzyme (e.g. a 4 base 
35 cutter) which statistically cleaves at least 50% of said 
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sub fragments into 3 further subfragments, will also result in 
a series of fragments suitable for later size fractionation. 

Step d) 

The mixture (s) obtained in step c) are then subjected to a 
5 reaction wherein adapter fragments are added to both ends of 
the double stranded cDNA fragments obtained. As mentioned 
above, this part of the procedure is greatly facilitated by 
the cleaved cDNA fragments having protruding "sticky" ends, 
because pre -designed adapter fragments which fit to these 
10 protruding ends can easily be prepared. 

The adapter (or anchor) fragments are added to the cleaved 
fragments in order to obtain "order in chaos" in the sub- 
sequent step. By adding known sequences to the termini of the 
cleaved fragments, one creates targets for specific amplifi- 

15 cation primers which can be designed specifically with the 
aim of amplifying sequences complying to the adapter frag- 
ments. The material thus obtained (primary template) can be 
pre-amplif ied, using primers complementary to the ligated 
adaptor sequences, giving rise to secondary template. The 

20 pre-amplif ication of primary template allows virtually un- 
limited amounts of template to be produced from one RNA 
preparation, avoiding the need for repeated isolations. 

The adaptor sequence is thus selected so as to serve as the 
starting point for DNA polymerisation in e.g. a PCR reaction. 
2 5 The adaptor sequences are constructed in such a way that the 
specific endonuclease sites are not regenerated after liga- 
tion of said adaptor. 

In a preferred embodiment at least one termination fragment 
is also ligated to the 3' -end of single strands of cleaved 
30 cDNA fragments, said at least one termination fragment intro- 
ducing a block against DNA polymerization in the 5'-»3' direc- 
tion setting out from the at least one termination fragment 
and said at least one termination fragment being unable to 



> 



« 

f 



WO 98/51 789 PCT/DK98/001 86 

17 

anneal to any primer of the at least two primer sets in step 
e) during the molecular amplification procedure. 

The above is a very important procedure when combined with 
the use of detection effected by labelled primers in the 
5 amplification step, wherein only one member of the pair of 

primers is labelled whereas the other is designed to split up 
the amplified products according to their base composition 
adjacent to the adapter fragment. One important feature is 
that a single stranded cDNA fragment which has been provided 

10 with a termination fragment will not be amplified, because no 
primers will be able to anneal to the products of a first 
round polymerisation wherein such a fragment was the templa- 
te, see Figure 7. Secondly, the approach opens for the possi- 
bility of removing background "noise" in a subsequent detec- 

15 tion phase. 

Normally, the at least one termination fragment comprises or 
is a chemically modified nucleotide sequence, such as for 
instance a nucleotide sequence which comprises a dideoxy- 
nucleotide in the 3' -end; this termination technique is- well- 

2 0 known from e.g. the chain- termination sequencing technique 

according to Sanger. Under normal circumstances, the dideoxy- 
nucleotide should, according to the invention, be covalently 
attached to the nucleotide strand so as to avoid loss of the 
dideoxynucleotide during subsequent rounds of amplification. 
25 Superior stabilisation is attained if the dideoxynucleotide 
is phosphorylated. 

As mentioned above, the ligation of adapter and/or termina- 
tion fragments to the cleaved cDNA fragments in step d) is 
conveniently achieved by annealing the adapter fragments to 

3 0 sticky ends of the cDNA resulting from the cleavage in step 

c) and subjecting the product to the action of an enzyme 
having DNA ligase activity. Any suitable DNA ligase known in 
the art can be used. 
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Step e) 



Step e) of the method of the invention results in the final 
sorting of the modified cDNA fragments from step d) . As steps 
b) , c) and d) are combined in the broadest embodiment of the 
5 invention, step e) corresponds to step c) of this embodiment 
described above. 



The primers having the structure of formula I (step c) or II 
(step e) are designed so as to selectively amplify synthesi- 
zed double stranded cDNA fragments obtained in step b) or 

10 predefined subsets of the adapted fragments obtained in step 
d) . A number of ways this can be done may be envisaged, but 
the main strategy is to prime amplification in a series of 
separate reactions where the nucleotide sequence of one 
primer in one reaction ensures that the amplified products of 

15 that reaction are different from those obtained from any of 
the other reactions and that all the reactions result in 
amplification of all fragments obtained from step b) or d) , 
respectively . 

Even though the at least one set of amplification primers of 

2 0 formula I or II wherein has a nl which is a 0, it is prefer- 

red that nl=l, nl=2, nl=3, or nl»4 in one of the primers, 
because the number of primer fragments to be used in the 
reactions in order to cover all possible nucleotide stretches 
adjacent to the Con or adapter fragment is easily manageable. 
25 For instance, if nl=5, it would be necessary to use 4 5 =1024 
different primers in order to obtain amplification of all 
possible nucleotide sequences adjacent to the relevant adap- 
ter fragment, and since the preferred embodiment of the 
invention requires that each such primer is used in a sepa- 

3 0 rate reaction, the work involved would be problematic. 



It is also preferred that in one of the primers n=0, and it 
is especially preferred that this primer is labelled, in 
order to facilitate determination of the amplified fragments. 
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Hence, in the most preferred embodiments of the invention, 
the adapted cDNA fragments are amplified in a number of sepa- 
rate reactions wherein a labelled primer is used (which is 
normally identical in all reactions) and at least one non- 
5 labelled primer which is a member of the set of primers 

described above where n>l. It is preferred that this set of 
amplification primers of formula I or II wherein nl>0 compri- 
ses all possible combinations and permutations of A, G, T, 
and C in the group N nl , since this will ensure that all 
10 possible cDNA fragments can be amplified by the set* 

Hence, the ligated cDNA fragments are sub-divided into a 
number of pools prior to the molecular amplification in step 
e) , and each pool is subjected to the amplification using a 
subset of the set of amplification primers, and in the most 

15 preferred embodiment the ligated cDNA fragments of step d) 
are subdivided into 4 nl pools which are each subjected sepa- 
rately to step e) wherein is used one amplification labelled 
primer as described above (nl=0) and one primer from the set 
of amplification primers as defined above (n>0) , said one 

20 primer being distinct from any one of the primers used for 
amplifying any of the other pools. By using this approach, 
the originally reverse transcribed and cleaved cDNA fragments 
are subdivided into 4 nl pools which can each be subjected to 
further steps. 

25 Further steps and applications 

The material obtained from the above -described series of 
reactions can now be utilised in a number of ways. Normally, 
a further step of separating amplified fragments obtained 
from the molecular amplification procedure is performed. This 

30 yields a mixture of amplified fragments which are separated 
e-£f- by size separation, by mobility in a gel electrophoresis 
or by any suitable chromatographic method. Furthermore, a 
step of identification (e.g. by visualization of these sepa- 
rated fragments) is normally carried out for "book-keeping 

35 purposes"; the separated mixture of fragments will normally 
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be compared to some kind of reference which may be material 
derived from the same or another cell type. 

Visualization of the separated fragments can, as mentioned 
above, be achieved by one of the primers in the amplification 
5 reaction being labelled, but other methods are of course 

available. For instance, a specifically labelled probe which 
e.g. binds to one of the adapter sequences will visualise the 
fragments, but also labelled nucleotides which have been 
incorporated in the fragments during the amplification pro- 
10 cedure (e.g. a PCR) will of course be a suitable means for 
detection (e.g. by incorporating radioactive or fluorescent 
alpha dNTP into the cDNA fragment during PCR, where N = A, C, 
T, U or G) . 

However, it is preferred that visualisation of specific RNA 

15 Derived Fragments (RDFs) is achieved using primers which are 
radioactively or f luorescently labelled and are homologous to 
the adaptors. The comparatively high annealing temperatures 
(touch-down from 65°C to 56°C) which are preferably used 
ensure that polymerisation events will predominantly origina- 

2 0 te from perfect priming of adapter sequences and adjacent 

selective bases. Band intensities are largely a function of 
initial template concentration, whereas band intensities of 
the original Differential Display methods are dependent on 
the quality of the match between the individual template and 

25 primer. The visualisation of rare mRNAs using the present 

inventive methods will be less hampered by the over- represen- 
tation of signal from highly abundant mRNAs. As in the case 
of arbitrary priming, the mismatch amplification and abundant 
RDFs always out -compete the amplification of rare fragments 

30 base pair perfectly. Our experiments suggest that as few as 
100 molecules can be routinely detected in a given template. 
This corresponds to less than 1 transcript per cell in the 
original tissue. 

One interesting part of the invention relates to the use of 
35 the above -described methods in bioinf ormatics . In short, 
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known DNA sequences are inputted into a computer database, 
and on the basis of such sequences a comparison with a real- 
life run of the above -described methods can be performed. In 
this way, bands in a gel obtained from the methods of the 
5 invention can be unambiguously identified with respect to 

sequence, origin and even functionality. Hence, this part of 
the invention pertains to a method for determining the pre- 
sence of an expression product in a cell or group of cells, 
the method comprising providing an RNA- containing sample from 

10 the cell or group of cells and subjecting the sample to the 
method described above, and thereafter performing a compari- 
son of the thus identified amplified cDNA fragments with a 
database output, said database output comprising a computer- 
generated list of molecular weights of restriction DNA frag- 

15 ments of known sequences, said list being prepared by 



20 - subsequently simulating cleavage of the virtual DNA 



inputting and storing DNA sequence data in a database as 
virtual DNA sequences (these can be obtained and updated 
regularly from any database containing information about 
gene sequences from the relevant organism or cell type) , 



25 



35 



30 



sequences with the at least one restriction nuclease and 
storing the resulting simulated cleavage products as 
virtual cleaved DNA fragments (such simulation is relati- 
vely uncomplicated, since the recognition and cleavage 
patterns of a large number of restriction enzymes are 
already known) , 

simulating ligation to the virtually cleaved fragments of 
the at least two adapter fragments and storing the re- 
sults as virtually ligated DNA fragments (again, this 
merely requires that input is provided of the structure 
of adapter fragments used in the real-life process), 
for each individual combination of primers used in step 
e) grouping the virtually ligated DNA fragments suscep- 
tible to amplification by said combination of primers in 
the same group, 
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determining, in each group, the absolute and/or relative 
molecular weight of each virtually ligated DNA fragment, 
and 

outputting the content of each group in the form of a 
5 list comprising the absolute and/or relative molecular 

weights of the virtually ligated fragments in the group. 

It is preferred that a link is maintained between each member 
of the output list and the original sequence from which such 
a member has been derived. This can e.g. be done by linking 

10 the input DNA sequence data to data relating to the genetic 
origin of the DNA sequence data and optionally to data rela- 
ting to functional features relating to the genetic origin 
and thereafter maintaining the information as a pointer back 
in the system to said sequence. Hence, the output indication 

15 will conveniently further comprise information about the 
genetic origin of the virtually ligated DNA fragment and 
optionally information about functional features associated 
with the genetic origin. 

For ease of use of such a bio- inf ormatic system, it is nor- 
20 mally necessary that 1) either the comparison is performed by 
inputting the identified amplified cDNA fragments in a format 
which allows automated comparison with the database output, 
or 2) the database output is outputted in a format which 
allows for direct comparison between the separated amplified 
25 cDNA fragments and the database output. For instance, if the 
visualized and separated cDNA fragments from step e) have 
been run on a gel, it will be possible to either read a 
digital reproduction of the gel pattern into the computer and 
let the computer compare this input with the computer gene- 
30 rated pattern, or alternatively, to output the computer 
generated pattern in such a manner that it resembles an 
electrophoresis gel pattern. 

Another part of the invention pertains to the use of the 
inventive method for comparing expression levels in different 
35 cells. One way of doing this is to determine the change in 
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expression, compared to the expression in a reference cell or 
reference group of cells, of an expression product in a cell 
or group of cells which has been subjected to a first set of 
conditions influencing the expression pattern of said cell or 
5 group of cells, said reference cell or group of cells being 
subjected to a second set of conditions, the method compri- 
sing providing an RNA- containing sample from the cell or 
group of cells and subjecting the sample to the method of the 
invention for sub-division, thereby obtaining data describing 

10 the amplified cDNA fragments derived from the sample, pro- 
viding reference data describing amplified cDNA fragments 
derived from an RNA- containing reference sample from the 
reference cell or reference group of cells, the reference 
data being obtained by having previously subjected the refe- 

15 rence sample to the method of the invention, subsequently 

performing a comparison of the data and the reference data to 
identify the cDNA fragments which are expressed at different 
levels in the two data sets, and thereafter using the diffe- 
rentially expressed cDNA fragments to determine which expres- 

20 sion products are subject to a change in expression level. In 
other words, the method of the invention is carried out twice 
on the basis of two different RNA samples derived from cells 
subjected to differing conditions. 

Normally, the data and reference data are selected from the 
25 group consisting of the apparent molecular weights of the 
amplified DNA fragments, the M r of the amplified DNA frag- 
ments, the absolute amount of the amplified DNA fragments, 
and the relative amounts of the amplified DNA fragments. The 
reference data can further be extracted from a database 
3 0 containing the reference data defined above and optionally 
further information relating to the genetic origin of each 
amplified cDNA fragment from the reference. 

Related to the above, the invention also allows for diagnosis 
of disease which is characterized by a deviating (increased 
35 or reduced) expression level of at least one expression pro- 
duct in at least one cell type, the method comprising pro- 
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viding an RNA- containing sample derived from the at least one 
cell type, subjecting the sample to the method of the inven- 
tion thereby obtaining data describing the amplified cDNA 
fragments derived from the sample, providing reference data 
5 describing amplified cDNA fragments derived from a RNA- con- 
taining reference sample derived from the same type of cell 
from a subject not suffering from the disease, the reference 
data being obtained by having previously subjected the refe- 
rence sample to the method according to the invention, and 

10 subsequently performing a comparison of the data and the 

reference data with respect to those cDNA fragments which are 
known to be related to the disease, and assessing whether a 
significant difference in the data and reference data exists 
so as to establish whether the expression level of the ex- 

15 pression product deviates or not. 

As for the embodiment above, also here the data and reference 
data are selected from the group consisting of the apparent 
molecular weights of the amplified DNA fragments, the M r of 
the amplified DNA fragments, the absolute amount of the 

20 amplified DNA fragments, and the relative amounts of the 

amplified DNA fragments, and also here the reference data can 
be extracted from a database containing the reference data 
defined above and optionally further information relating to 
the genetic origin of each amplified cDNA fragment from the 

25 reference. 

Further, the invention provides a method for treatment of a 
disease which is characterized by a deviating (increased or 
reduced) expression level of at least one expression product 
in at least one cell type, the method comprising providing an 

3 0 RNA- containing sample derived from the at least one cell 

type, subjecting the sample to the method of the invention 
thereby obtaining data describing the amplified cDNA frag- 
ments derived from the sample, providing reference data 
describing amplified cDNA fragments derived from a RNA- con - 

35 taining reference sample derived from the same type of cell 
from a subject not suffering from the disease, the reference 
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data being obtained by having previously subjected the refe- 
rence sample to the method according to the invention, and 
subsequently performing a comparison of the data and the 
reference data with respect to those cDNA fragments which are 
5 known to be or suspected of being related to the disease, and 
assessing whether a significant difference in the data and 
reference data exists so as to establish whether the expres- 
sion level of the expression product deviates or not. 

If the expression product is reduced, the disease may be 
10 treated by delivering the expression product; if the expres- 
sion product is increased, the disease may be treated by 
delivering an inhibitor (e.g. an antibody) against the ex- 
pression product. The scope of the present invention includes 
an expression product identified by the method of the inven- 
15 tion as such as well as methods for treating a disease which 
method has been provided by means of the method of the inven- 
tion . 

The mixtures of amplified fragments obtained from step e) of 
the method of the invention may also be used for preparing a 
20 surface (chip) coated with cDNA fragments. This can be done 
by 

subjecting an RNA- containing sample to the subdivision 
method of the invention including separation steps, and 

transferring the separated amplified cDNA fragments to a 
25 chip surface adapted to stably bind the separated ampli- 

fied cDNA fragments while maintaining the spatial relati- 
ve distribution pattern thereof. 

Alternatively, such a chip can be prepared by 



30 



subjecting an RNA- containing sample to the method of the 
invention without performing the separation, and there- 
after 
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separating, by electrophoresis, the thus obtained ampli- 
fied cDNA fragments on a particular surface adapted- to 
stably bind the separated amplified cDNA fragments while 
maintaining the relative distribution pattern after 
5 electrophoresis. In this embodiment, the electrophoresis 

is preferably in the form of microelectrophoresis. 

Transfer to the surface is preferably accomplished by a 
electrophoretic blotting technique, and/or by well-known 
photo-activated organic or inorganic chemistry coupling 
10 techniques. 

The invention also pertains to a surface obtainable by the 
above-mentioned method for the preparation thereof. Such 
surfaces are considered novel and inventive, since known "DNA 
chips" rely on specific introduction of an array of nucleic 
15 acid fragments of known structure, whereas the present method 
provides for "a semi-array" containing cDNA fragments charac- 
terizing a specific "situation" for a specific cell type, 
according to Figure 8, 

Such a surface can i.a. be used for screening for genes 
within a gene family. The "array chip" is provided and there- 
after a labelled probe (which is a representative of a gene 
family) is allowed to hybridize to the chip under low strin- 
gency i.e. under conditions as described at pages 94-106 in 
"Nucleic acid hybridisation. A practical approach" edited by 
BD Hames & S J Higgins, IRL Press. A number of fragments 
coupled to the chip will hybridize to the probe, and these 
fragments can subsequently be identified, isolated and se- 
quenced/characterized in order to determine whether they are 
representatives of the same gene family. 

3 0 Another use of such "semi -arrays" is for determining the dif- 
ference in expression pattern between a first cell or type of 
cells and a second cell or type of cells, the method compri- 
sing providing samples of labelled RNA or cDNA from the first 
and second cells or cell types and subsequently contacting 
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each of these samples with a chip surface as described above , 
and subsequently detecting the amount and distribution of 
bound labelled RNA or cDNA from each sample. 

Under all circumstances, the chip surface with the cDNA bound 
5 thereto can e.g. be produced by the methods described in EP-0 
654 061. 

Yet another part of the invention pertains to a method for 
screening for interactions between a pre- selected protein and 
a polypeptide fragment, the method comprising preparing a 

10 sub-divided library of amplified cDNA fragments resulting 
from step e) , optionally adapting the terminals of the mem- 
bers of the library so as to facilitate insertion in a vec- 
tor, inserting the fragments into vectors, transforming a 
population of suitable host cells with the vectors, culturing 

15 the host cells under conditions which enable expression of 
correctly inserted cDNA fragments by the host cell, and 
subsequently assaying polypeptide fragments encoded by the 
inserted cDNA fragments for interaction with the pre -selected 
protein. 

2 0 One convenient way of achieving this is by way of a two- 

hybrid technique, wherein the host cells are eukaryotic cells 
(such as fungal cells, especially yeast cells) which are 
mated or transfected with nucleic acid material encoding the 
pre-selected protein, successful mating/transf ection of the 

25 cell(s) resulting in a cell or cells wherein the interaction 
between the pre-selected protein and a polypeptide fragment 
gives rise to a detectable signal. 

Such methods have recently attracted a great deal of atten- 
tion, i.a. as a consequence of the disclosure in Fromont- 
30 Racine et. al . , Nature Genetics 16, 277-282 (1997), which is 
incorporated by reference herein. 



One convenient system for providing the detectable signal is 
by use of Green Fluorescent Protein, disclosed in EP-A-0 569 



> 

> 



WO 98/51789 PCT/DK98/00186 

28 

170, wherein changes in fluorescent spectrum due to inter- 
actions are used as reporter. 

Finally, the invention pertains to a composition for use in 
reverse transcription of RNA, the composition comprising 

5 a) a first enzyme having reverse transcriptase activity 

at temperatures not exceeding 55 °C 

b) a second enzyme having reverse transcriptase activity 
at elevated temperatures in the range of 45°C - 95°C (and 
especially the temperatures discussed above for perform- 
10 ing reverse transcription at elevated temperatures) # 

said second enzyme having a substantially higher activity 
than said first enzyme in catalyzing reverse transcription at 
said elevated temperatures. It is preferred that the first 
enzyme has a substantially higher activity than said second 
15 enzyme in catalyzing reverse transcription at said tempera- 
tures not exceeding 55 °C, and it is also preferred that the 
second enzyme has a substantially higher activity than said 
first enzyme in catalyzing reverse transcription at said 
temperatures exceeding 45 °C. 

20 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

First, the drawing will be briefly described. 

Fig. 1 

Basis of Display Of Differentially Expressed Transcripts. 
Fig. 2 

25 Anchor and PCR primer design. 
Fig, 3 

An autoradiogram of a DODET gel using the cellular set-up 
described in Example 1; rat pheochromocytoma PC12 cells were 
stimulated with the Nerve Growth Factor (NGF) and Epidermal 
3 0 growth factor (EGF) . 
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Lanes 1-24, reverse transcription using the anchored poly T 
primer 5'-T 25 AA-3' 

Lanes 25-48, reverse transcription using the anchored poly 
T primer 5'-T 25 GC-3' 



5 Lanes 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45 represent 
the PC12 cells not treated. 



Lanes 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42 and 46 repre- 
sent the PC12 cells treated with the NGF factor for 60 minu- 
tes . 



10 Lanes 3, 7, 11, 15, 19, 23, 27, 31, 35, 39, 43 and 47 repre 
sent the PC12 cells treated with the NGF factor for 90 minu 
tes 

Lanes 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44 and 48 repre 
sent the PC12 cells treated with the EGF factor for 90 minu 
15 tes. 



Lanes 1-48, using the following pairs for the pre-PCR 
amplifications : 

TagI pre-amplif ication primer: 5 ' - CAGCATGAGTCCTGACCGA 
Sell pre-amplif ication primer: 5 ' - CTCGTAGACTGCGTACCGATCA 

20 For the second PCR amplification the following primer pairs 
were used: 
Lanes 1-4 

5 ' - C ATG AGTC CTG AC CG AA 

5 ' - GACTG CGTACCGAT C AA ( 5 ' end label 1 ing ) 

25 Lanes 5 - 8 

5 ' - CATGAGTC CTGACCGAA 

5 ' - GACTGCGTACCGATCAC ( 5 ' end label 1 ing ) 

Lanes 9-12 
5 ' - CATGAGTC C TG AC CGAA 
30 5 ' - GACTGCGTACCGATCAG (5' end labelling) 



WO 98/5 1 789 PCT/DK98/00 1 86 

30 

Lanes 13 - 16 

5 ' - CATGAGTCCTGACCGAA 

5' -GACTGCGTACCGATCAT (5' end labelling) 



Lanes 17 - 2 0 
5 5 ' - CATGAGTCCTGACCGAC 

5 ' - GACTGCGTAC CGATCAA ( 5 ' end label 1 ing ) 

Lanes 21 - 2 4 

5 ' - CATGAGTCCTGACCGAC 

5 ' -GACTGCGTAC CGATCAC (5' end labelling) 

10 Lanes 25 - 48 

Repeated primer combination from lanes 1-24 

Fig. 4a 

Northern Blot of RDF 01 sequence from cellular total RNA. 
a) PC12 cells not treated, b) NGF treatment for 60 minutes, 
15 c) NGF treatment for 9 0 minutes, and d) EGF treatment for 9 0 
minutes . 

Fig. 4b 

Loading control, RNA extracts were electrophoresed on a 1.2% 
agarose gel containing ethidium bromide, used as a control to 
20 determine the relative concentration of RNA in each lane, a, 
b, c, d same as in Figure 4a 

Fig. 4c 

Northern Blot of RDF02 sequence from cellular total RNA. 
a) PC12 cells not treated, b) NGF treatment for 60 minutes, 
25 c) NGF treatment for 90 minutes, and d) EGF treatment for 9 0 
minutes . 



Fig. 4d 

Loading control, RNA extracts were electrophoresed on a 1.2% 
agarose gel containing ethidium bromide, used as a control to 
3 0 determine the relative amount of RNA in each lane, a, b, c, d 
same as in Figure 4c 
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Fig. 5 

Searching for genes modulated by a growth factor. 



Lane 1 
Lanes 2-6 



Lanes 7-11 



Lanes 12-16 



10 Lanes 17-21 



Lanes 22-26 



where N. 



nl 



where N. 



m 



where N. 



Size marker in bp (150 bp, 200 bp, 250 bp) . 
Amplification primer 5 ' -Com-N nl -3 ' where N nl 
is GAA 

Amplification primer 5*-Com-N nl -3 
is GAC 

Amplification primer 5'~Com-N nl -3 
is GAG 

Amplification primer 5 ' -Com-N nl -3 
is GAT 

Amplification primer 5 ' - Com-N nl - 3 ' where N nl 
is GCA 

In lane 11 a downregulation is observed after 6 days treat- 
ment, whereas in lane 16 an upregulation is observed after 6 
days treatment. Both modulations are due to the growth fac- 
tor, since regulation is seen only when the active growth 
factor is present. 



Fig 6 

20 Searching for genes involved in bacterial resistance. 

Lane 1: Size marker in bp (150 bp, 200 bp, 250 bp, 

300 bp) . 

Lanes 2-5 Amplification primer 5 ' -Com-N nl -3 ' where N nl 

is GAA 

25 Lanes 6-9 Amplification primer 5 ' - Com-N nl - 3 ' where N nl 

is GAC 

Lanes 10-13 Amplification primer 5 ' -Com-N nl -3 ' where N nl 

is GAG 

Lanes 14-17 Amplification primer 5 ' -Com-N nl -3 ' where N nl 
3 0 is GAT 

Lanes 18-21 Amplification primer 5 • -Com-N nl -3 ' where N nl 

is GCA 

Lanes 22-25 Amplification primer 5 9 ~Com-N nl -3 ? where N nl 

is GCC 



35 In lanes 8-9 a downregulation is observed, in lanes 20-21 an 
upregulation is observed. Both gene modulations are potential 
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genes involved in the resistance to the Baeteriamycin, Ino- 
sin. 

Fig. 7 

Principle of the technology used in Examples 4 and 5. 

5 After ds-cDNA synthesis the DNA is digested with one 4 base 
pair endonuclease and anchors are ligated to the ds-cDNA 
ends* Using special design primers the expression profiles 
are obtained by amplifying the mRNAs in different expression 
windows ( sub- fractions) . The number of expression windows 
10 depends on the complexity of the sample i.e. 64 expression 
windows in eukaryotic. 

Fig. 8 

Principles of a gene discovery DNA surface (a DNA chip) . 

After size separation of the DNA fragments, the DNA fragments 
15 are transferred to a nylon membrane using an electrophoretic 
principle. The membrane is hybridized with a complex DNA 
probe generated using the principle of the invention. Alter- 
natively the membrane can be hybridized with one single gene 
to identify new members of a particular gene family. The 
20 membrane are in the x coordinates separated in 64 expression 
windows, and in the y coordinates separated in base pair size 
(from 50 base pair to 1200 base pair) according to principle 
described in figure 7. 

Fig. 9 

25 Principle of generation 64 pools of 3' END cDNAs 
Step 1 

Production of single stranded cDNA using 5 1 - con^^V oligo- 
nucleotide where con-L is an oligonucleotide between 1-100 
nucleotide, n is between 5-40 and V is a mixture of A, C and G. 
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Step 2 

Double stranded cDNA synthesis are produced using 5 1 Con 2 N x 
where con 2 is an oligonucleotide between 1-100 nucleotide, x is 
between 1-10, and N is a mixture of A, C, T and G. The ds-cDNA 
5 synthesis is synthesized by Klenow enzyme with the above- 
described oligonucleotide. 

Step 3 

Pre -amplification of double stranded cDNA to amplify the double 
stranded cDNA, the cDNA is PCR amplified using a combination of 
10 con x and con 2 primers. 

Step 4 

The pre-amplif ied cDNA is further amplified and separated in 64 
pools using a combination of a labeled co^ and 64 con 2 NNN 
primers in a PCR amplification procedure, where NNN are 
15 combined in 64 different ways using the nucleotides A, T, G and 
C. 

Step 5 

Each of the 64 pools is separated using the Page electro- 
phoresis principle . 

2 0 EXAMPLES 

In order to verify the functionality of the invention, examp- 
les are described below in which a developmental eukaryotic 
cellular system, pheochromocytoma PC12 , was employed. 

Nerve Growth Factor (NGF) induces growth arrest and neurone 
25 outgrowth in the in vitro PC12 cell system. Other growth 
factors, such as epidermal growth factor (EGF) , support 
survival and stimulate growth. NGF- induced genes, include the 
immediate early genes, which encode transcription factors, 
such as c-fos and c-myc. The products of the immediate early 
30 genes are thought to be involved in regulating the expression 
of genes, associated with the neuronal phenotype for example 
neurofilaments, peripherin, GAP 4 3 and transin. 



I 
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In order to identify new early genes involved in neuronal 
differentiation and proliferation, the following DODET method 
is used for identify such genes. 

In the following examples, it is demonstrated how efficiently 

5 the method of the invention can be applied to such cellular 
systems . 



EXAMPLE 1 



The rat pheochromocytoma PC12 cells were grown (in vitro) in 
the presence and absence of Nerve Growth Factor (NGF) and 
10 epidermal growth factor (EGF) under growth conditions descri- 
bed elsewhere (Saltiel et a2. 1996). 



The total RNA was isolated using the standard single -step 
method by Chomczynski and Sacchi according to Sambrook et al 
1989 . 



15 Total RNA concentration was determined spectrophotometrically 
and then adjusted to 0.2 jxg/jxl . This RNA was used directly in 
the Northern analysis. 

For DODET 4 x 0 . 5 fxg total RNA was reverse transcribed in 
separated pools using the primer 5'-T 25 AA-3'. The same pro- 
20 cedure was performed using the 5'-T 25 GC-3' poly-dT anchored 
primers, giving a total of 2 x 4 x 0.5 of RNA. 



First strand synthesis 



20.0 fil total RNA amount between 0.3 to 1.0 fig RNA 

3.0 /xl 5'-T 25 AA-3' Cone. 100 ng/jiel or 5 , -T 25 GC-3' Cone. 

25 100 ng//il 

5.0 /-tl 10 x cDNA buffer (buffer B from Epicentre Techno 

logies # R19250) 

2.0 jjlI dNTPs (25 mM from Pharmacia Biotech) 

1.0 ill Superscript II RT (200 U/jwl) (Gibco BRL # 18064- 

30 014) 



I 

1 
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5.0 pi Retrotherm RT (1 U//>tl) (Epicentre Technologies 

#R19250) 
14.0 pi H 2 0 

To obtain high specificity, the cDNA reaction was incubated 

5 at 50 °C for 30 minutes followed by 1 hour incubation at 70 °C. 



Second strand synthesis: 



To the first strand reaction, add the following components 



15.0 fil 10 x CDNA buffer 

3.0 fil Hybridase Thermostable RNase (1 U/ptl) (Epicentre 

10 Technologies #H39050) 

1.0 fjtl rBst thermostable DNA polymerase (1 U/ftl) ) (Epi- 

centre Technologies #BH1100) 
81.0 fil H 2 0 



Incubate at 65°C for 1 hour. 



15 The resulting double stranded cDNA was phenol extracted and 
precipitated and resuspended in 20 fil of H 2 0. Half of this 
volume was checked on gel; if a smear between 100 bp and 3000 
bp was observed, the rest of the cDNA was used for DODET 
template production. The resulting cDNAs were digested with 

20 10 U of each of the thermostable restriction enzymes TaqI and 
Bell at 50 °C for 2 hours. To this mixture, DODET adapters 
were added and ligated to the ends of the restriction frag- 
ments with T 4 DNA ligase (1U) resulting in the primary tem- 
plate. 8-15 cycles of non- radioactive pre-amplif ication, 

25 using primers complementary to the DODET adapters, were 

performed on a small aliquot (l/l0 th volume) of the primary 
template (94°C denaturation; 30 s, 56°C annealing; 30 s, 72°C 
polymerisation; 1 min) . The products of the amplification 
(termed secondary template) were also checked on a 1.5% 

30 agarose gel. As expected, fragment sizes were predominantly 
between 100 bp and 1000 bp. All amplification reactions were 
carried out on a PE-9600 thermocycler using Tag DNA-polyme- 



• 
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rase, both from Perkin Elmer Corp. (Norwalk, CT, USA) . The 

final template was then diluted 10 fold with H 2 0. 

The adapters ligated to the restriction fragments, the pre- 
amplification and active PCR are given below: 

5 TagI adapter: 5 ' - CAGCATGAGTCCTGAC 

TACTCAGGACTGGC- 5 ' 

TagI pre-amplif ication primer: 5 ' - CAGCATGAGTCCTGACCGA 

TagI amplification primer: 5 1 - CATGAGTCCTGACCGAN 

(N = A or C or G or T) 

10 BcJI adapter: 5 ' - CTCGTAGACTGCGTACC 

CTGACGCATGGCTAG - 5 ' 

Bell pre-amplif ication primer: 5 ' - CTCGTAGACTGCGTACCGATCA 

Bell amplification primer: 5 ' - GACTGCGTACCGATCAN 

(N = A or C or G or T) 

15 For PCR all the different combinations of one extension 

(denoted as N above) were available, giving a total of 4 2 
primer combinations. All oligonucleotides were obtained from 
DNA Technology (Aarhus, Denmark) . 

Radioactive labelling of the Bell primer was performed using 
20 1U of T 4 polynucleotide kinase. Thermocycling was carried out 
essentially as described above but with 35 cycles and includ- 
ing an 11 cycle touch-down {the annealing temperature was 
reduced from 65°C to 56°C in 0.7°C steps for 11 cycles and 
subsequently maintained at 56°C for 23 cycles) . Samples were 
25 then boiled after the addition of dye and 50% formamide and 
separated on a 5% polyacrylamide sequencing type gel (GIBCO 
BRL Life Technologies Inc., Gaithersburg, MD, USA). All gels 
were run at standard conditions, such that the 70 bp marker 
was 3 cm from the bottom of the gel, giving good resolution 
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between 70-800 bp. Gels were then dried directly onto Whatman 
3M paper on a slab gel dryer. Labelled DNA fragments were 
visualised by autoradiography. Gels and films were positio- 
nally marked prior to development. The 1 base selective ex- 
5 tensions were chosen empirically to yield approximately 50 
radioactively labelled fragments per lane* 

Bands, identified on the autoradiogram as interesting, were 
lined up with markings on the film and the dehydrated gel and 
were excised. Excised fragments were monitored for activity. 

10 The gel fragments were isolated using GENE CLEAN (BIO101, 
California USA) . DNA was then recovered according to the 
manufacturer's recommendations. DNA fragments could then be 
reamplified using the same PCR conditions and primers as used 
in the initial PCR; however, 15 cycles generally yielded 

15 sufficient product for cloning. Cloning was achieved using 

unpurified PCR product and the vector display-pl23T (Display 
Systems Biotech, USA) . Conditions were used as recommended by 
the manufacturer. 

EXAMPLE 2 

20 Figure 3 shows a typical DODET gel produced by amplification 
of template derived from treatment of PC12 cells with NGF or 
EGF. 

Total RNA was reverse transcribed with the 5 , -T25AA-3 , and 
T25GC-3 1 poly-dT anchored primers, and after anchor ligation 
25 pre-amplif ication with Bell and TaqI pre-amplif ication primer 
pairs was performed. 

6 out of 16 possible primer combinations are shown, using 1 
selective base, at each restriction enzyme site (Figure 3) . 
The largest visible products (Figure 3) are approximately 
30 1000 bp in size and the lower end of the gel corresponds to 
approximate 100 bp. In this size window an average of 50 
bands can be scored for each primer combination. In Figure 3, 
various expression patterns can be detected. 
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Due primarily to the stringent conditions possible in DODET, 
resolution of the banding pattern is high while the level of 
background remains at acceptable levels (Figure 3) . Further- 
more, quite radical changes in the intensity of individual 
5 bands over the treatment period do not seem to affect the 
patterns of other bands in the same lane. 

It is, therefore, possible to conclude that the PCR remains 
proportionally independent on the concentration of individual 
substrates in the reaction. 

10 The use of an optimised combination of standard protocols 
described above for isolating, re- amplifying and cloning 
individual RDFs , has allowed the identification of a number 
of transcripts associated with differentiation and prolifera- 
tion events. 

15 Four RDFs were isolated for further analysis, Figure 3, bands 
a, b, c and d. 

Sequence analysis revealed that RDF a = RDF b and RDF c = RDF 
d, as illustrated in Figure 3. 

In all cases appropriate terminal sequences with the correct 
20 1 selective base extensions used in the PCR could be retriev- 
ed, demonstrating the stringency and fidelity of the system 
(data not shown) . 

EXAMPLE 3 

During scanning of the PC12 cellular systems treated with NGF 
25 or EGF with different primer combinations, two RDFs (designa- 
ted RDF01 and RDF02) exhibiting a differential expression 
during the NGF treatment were isolated (RDF01 = RDF a = RDF b 
and RDF02 = RDFc = RDF d. in Figure 3). 

After re -amplification, sub- cloning and DNA sequencing, 
3 0 further DNA analysis revealed two unknown RDFs upregulated 
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after 60 minutes NGF treatment or 90 minutes EGF treatment in 
the PC12 cellular system. The nucleotide sequences of both 
RDF01 and RDF 02 show less than 10% homology to any existing 
gene in the GeneBank or EMBL databases . 

5 The expression of RDF01 and RDF 02 was further analyzed using 
Northern blot (Figure 4a - 4d) . Here, transcripts could 
clearly be detected at 60 minutes NGF treatment or 90 minutes 
EGF treatment of the PC12 cells, confirming the results 
obtained using the DODET method, as illustrated in Figure 3. 

10 Experiments to clone the full length of RDF01 and RDF02, and 
biological characterisation of their involvement in the 
differentiation and proliferation, of the PC12 cellular 
system are currently under investigation. 

EXAMPLE 4 

15 Searching- for genes modulated by a growth factor. 

A human cell line was treated with a growth factor and RNA 
was isolated a various time points as indicated below. 

1 Cell without any treatment (lanes 2, 7, 12, 17, 22) 

2 Cell treated with helper agent, 1 day (lanes 3, 8, 13, 
20 18, 23) 

3 Cell treated with helper agent and growth factor, 1 day 
(lanes 4, 9, 14, 19, 24) 

4 Cell treated with helper agent, 6 days (lanes 5, 10, 
15, 20, 25) 

25 5 Cell treated with helper agent and growth factor, 6 

days (lanes 6, 11, 16, 21, 26) 

Human RNA was isolated and the gene discovery analysis was 
performed essentially as described in the legend to Fig. 3. 

5 out of 64 amplification primers are shown in Fig. 5, each 

3 0 covering a certain portion of the mRNA pool in the human cell 
line. The expression analysis was performed on an ALFexpress, 



WO 98/51789 PCT/DK 98/00 186 

40 

an automated fragment analyzer from Pharmacia Biotech, using 
a Cy5 label . 

In lane 11 a downregulation is observed after 6 days of 
treatment. In lane 16 an upregulation is observed after 6 
5 days of treatment. Both modulations are due to the growth 
factor, since regulation is only observed with the active 
growth factor present. 

EXAMPLE 5 

Searching for genes involved in bacterial resistance to 
10 antibiotics . 

A Listeria monocylogia strain was treated with the Bacteria - 
mycin, Inosin. RNA from a strain resistant to Inosin was 
further investigated, 

1. Bacterial clone 1 without any treatment {lanes 2, 6, 

10, 14, 18) 

2. Bacterial clone 2 without any treatment (lanes 3, 7, 

11, 15, 19) 

3. Bacterial clone 3 resistant to Inosin (lanes 4, 8, 12, 

16, 20) 

4. Bacterial clone 4 resistant to Inosin (lanes 5, 9, 13, 

17, 21) 

Bacterial RNA was isolated by standard techniques and the 
gene discovery analysis was performed according to Example 4 
and Fig. 5, with the exception that a 5 ' - NNNNNNYYA primer was 
25 used for first strand synthesis. 

6 of 64 amplification primers are shown in Figure 6, each 
covering a certain portion of the mRNA pool in the prokaryo- 
tic cell system. The expression analysis was performed on an 
ALFexpress, an automated fragment analyzer from Pharmacia 
30 Biotech, using a Cy5 label. 



15 



20 
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In lanes 8 and 9 a downregulat ion is observed and in lanes 20 
and 21 an upregulation is observed. Both gene modulations are 
potential genes involved in the resistance to Inosin. 
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CLAIMS 

1. A method for preparing a normalized sub-divided library of 
amplified cDNA fragments from the coding region of mRNA 
contained in a sample, the method comprising the steps of 

5 a) subjecting the mRNA derived from the sample to reverse 

transcription using at least one cDNA primer having the 
general formula 

S'-Con^dT^-V^-N^-S' 

wherein Con a is any sequence between 1-100 nucleotides, 
10 dT is deoxythymidinyl, V is A, G or C, N is A, G, C or T, 

n2 is an integer > 1, n3 is 0 or 1 , if n3 is 0 then n4 is 
0, and if n3 is 1 n4 is an integer s 0, thereby obtaining 
first strand cDNA fragments, 

b) synthesizing second strand cDNA complementary to the 
15 first strand cDNA fragments by use of the first strand 

DNA fragments as templates, and a second cDNA primer with 
the general formula 

5 7 -Con 2 -N x _3 ' 

wherein Con 2 is any sequence between 1-100 nucleotides 
20 and can be different or identical to con^ N x is A, G, T 

or C, and x is an integer s 0, in a appropriate enzyme/- 
buffer solution which comprises the DNA pol I enzyme or 
the Klenow fragment of the DNA pol I enzyme, all four 
deoxyribonucleoside triphosphates and standard buffer and 
25 temperature conditions, thereby obtaining double stranded 

cDNA fragments, and 

c) subjecting the cDNA fragments obtained in step b) to a 
molecular amplification procedure so as to obtain ampli- 
fied cDNA fragments, wherein is used a set of amplifica- 

30 tion primers having the general formula 
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5 ' - Con 3 - N nl - 3 ' 



wherein Con 3 is a sequence identical to either Con-L or 
Con 2 or both, N is A, G, T or C, and nl is an integer a 
0, wherein at least one set of primers has the general 
5 formula I where n > 0, said at least one set being ca- 

pable of priming amplification of any nucleotide sequence 
complementary in its 5' -end to Con x or Con 2 . 



2. A method for preparing a normalized sub-divided library of 
amplified cDNA fragments from the coding region of mRNA 
10 contained in a sample, the method comprising the steps of 

a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer, thereby 
obtaining first strand cDNA fragments, 



b) synthesizing second strand cDNA complementary to the 
15 first strand cDNA fragments by use of the first strand 

DNA fragments as templates, thereby obtaining double 
stranded cDNA fragments, 



c) digesting the double stranded cDNA fragments with at 
least one restriction endonuclease , thereby obtaining 

2 0 cleaved cDNA fragments, 

d) ligating at least two adapter fragments to the cleaved 
cDNA fragments obtained in step c) , so as to obtain 
ligated cDNA fragments, and 



e) subjecting the ligated cDNA fragments obtained in step 
25 d) to a molecular amplification procedure so as to obtain 

amplified cDNA fragments, wherein is used, for an adapter 
fragment used in step d) , a set of amplification primers 
having the general formula 



5' -Com-BL, - 3' 



II 
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wherein Com is a sequence complementary to at least the 
5' -end of an adapter fragment which is ligated to the 3'- 
end of a cleaved cDNA fragment, N is A, G , T, or C, and 
nl is an integer > 0, and wherein at least one set of 
5 primers has the general formula I where nl > 0, said at 

least one set being capable of priming amplification of 
any nucleotide sequence ligated in its 3' -end to the 
adapter fragment complementary in its 5' -end to Com. 

3. A method according to claim 1 or 2 , wherein the mRNA is of 
10 eukaryotic, Archae or prokaryotic origin. 

4. A method according to any of claims 1-3 , wherein the 
reverse transcription is performed under high stringency 
conditions . 

5. A method according to any of the preceding claims, wherein 
15 the reverse transcription is carried out at a temperature in 

the range from about 45 °C to about 95 °C by use of an enzyme 
having reverse transcriptase activity at said temperature. 

6. A method according to claim 5, wherein the enzyme is 
thermostable, such as an enzyme selected from the group 

20 consisting of a DNA polymerase with reverse transcriptase 
activity derived from thermophilic eubacteria, such as Taq 
(Thermus aquaticus) , Stoffel (Thermus aquaticus) , Tht (Ther- 
mus thermophilus) , Tfl/Tub (Thermus flavus) , Tru (Thermus 
Ruber) , Tea (Thermus caldophilus) , Tfil (Thermus f iliformis) , 

25 Tbr (Thermus Brockianus) , Bst (B. Stearothermophilus) , Bca 
(B. Caldotenax YT-G) , Bcav (B. Caldovelox YT-F) , FjSS3-B.l 
(Thermotoga FjSS3-B.l), Tma (Thermus Maritima) , UITma (T. 
Maritima) , Tli (T. Litoralis) , Tli exo- (T. Litoralis) , 9°N-7 
(Thermococcus sp.), BG-D (Pyrococcus sp.), Pfu (P. furiosus) , 

30 Pwo (P. woesei) , Sac (S. Acidocaldarius) , Ssol (S. Solfatari- 
cus) , Tac (T. Acidophilum) , and Mth (Methananococcus Voltae) . 

7. A method according to any of claims 1-4, wherein the 
reverse transcription is carried out at a temperature in the 
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range from about 25 °C to about 55 °C by use of an enzyme 
having reverse transcriptase activity at said temperature. 

8. A method according to claim 7, wherein the enzyme is a 
reverse transcriptase, such as a reverse transcriptase selec- 

5 ted from reverse transcriptase from AMV (Avian Myeloblastosis 
Virus), M-MuLV (murine M-MuLV pol gene), or HIV-l (HIV 
virus) . 

9. A method according to any of claims 1-4, wherein the 
reverse transcription is carried out in two subsequent steps, 

10 the first step comprising carrying out reverse transcription 
as defined in claim 7 or 8, and the second step comprising 
carrying out reverse transcription as defined in claim 5 or 
6 . 

10. A method according to claim 9, wherein reverse transcrip- 
15 tion in the two steps is effected by non- identical enzymes 

having reverse transcriptase activity. 

11. A method according to claim 10, wherein the non- identical 
enzymes are added separately in each step or are present in 
both steps. 

20 12 . A method according to claim 11, wherein the activity of 

the enzyme which is active in the first step is substantially 
abolished in the second step. 

13. A method according to any of claims 9-12, wherein the 
enzyme effecting reverse transcription in the first step is 

25 reverse transcriptase from MMuLV, AMV or HIV-l and/or the 

enzyme effecting reverse transcription in the second step is 
Tth or Taq. 

14. A method according to any of the preceding claims, where- 
in the at least one cDNA primer includes an oligo or poly dT 

30 tail in the 3' end. 
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15. A method according to claim 14, wherein the at least one 
cDNA primer has the general formula 5 ' -dT n2 - V n3 -N n4 - 3 ' , where- 
in dT is deoxythymidinyl, V is A, G, or C, N is A, G, C or T , 
n2 is an integer & 1, n3 is 0 or 1, if n3 is 0 then n4 is 0, 

5 and if n3 is 1 then n4 is an integer & 0. 

16. A method according to claim 15, wherein, when n3 is 1, 
3 x 4 n4 groups of cDNA primers are used, each group being 
distinct from any one of the other groups with respect to the 
structure -V n3 -N n4 - . 

10 17. A method according to claim 16, wherein the pool of mRNA 
is subdivided into 3 x 4 n4 aliquots which are each subjected 
separately to step a) utilising one of the 3 x 4 n4 groups of 
cDNA primers, thereby obtaining a subdivision of the first 
strand cDNA into 3 x 4 n4 separate pools. 

15 18. A method according to claim 16 or 17, wherein n4 is 0 or 
1 . 

19. A method according to any of claims 1-13, wherein the at 
least one cDNA primer does not include a poly or oligo dT 
tail in the 5 '-end, or wherein at least two cDNA primers are 

2 0 used of which at least one includes a poly or oligo dT tail 
in the 5' -end and of which at least one second does not 
include a poly or oligo dT tail in the 5' -end. 

20. A method according to claim 19, wherein the cDNA primer 
which does not include a poly or oligo dT tail in the 5' -end 

25 has the following structure 

5'-N x TTA-3' or 5'-N x CTA~3' or 5'-N x TCA-3', 
wherein N is A, G, T, or C, and x is an integer 1 <; x <; 20. 



21. A method according to any of the preceding claims, where- 
in step b) is carried out under conditions which minimize the 
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formation of mismatches between nucleotides in the first and 
second cDNA strands. 

22. A method according to any of the preceding claims, where- 
in the at least one restriction enzyme is chosen so as to 

5 ensure that at least 60% of cDNA's are cleaved. 

23. A method according to any of the preceding claims compri- 
sing the use of at least one restriction endonuclease which 
upon cleavage of cDNA results in cleaved cDNA fragments 
having sticky ends. 

10 24. A method according to any of the preceding claims, where- 
in the at least one restriction enzyme is chosen so as to 
cleave each complete cDNA into an average of about 3 frag- 
ments . 



25. A method according to any of the preceding claims, com- 
15 prising the use of a rare 4 base cutter as at least one 
restriction endonuclease, such as the 4 base cutter Acil, 
Alul, Bfal, BstUI, Csp6I, Dpnl, DpnII, Haelll, Hhal, HinPlI, 
Hpall, Mbol, Mnll, Msel , Mspl, Nlalll, Rsal , Sau3AI, Tail, 
TaqI, and Tsp5 09I. 

20 26. A method according to any of the preceding claims, where- 
in one restriction enzyme is used. 

27. A method according to any of claims 1-21, which comprises 
the use of a first restriction enzyme which statistically 
cleaves at least 2 0% of complete cDNA derived from the mRNA 

25 sample into two subf ragments , and of a second restriction 
enzyme which statistically cleaves at least 50% of said 
subf ragments into 3 further subf ragments . 

28. A method according to any of the preceding claims, where- 
in, in step d) , at least one termination fragment is also 

30 ligated to the 3' -end of single strands of cleaved cDNA frag- 
ments, said at least one termination fragment introducing a 



* 
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block against DNA polymerization in the 5 ' -*3 ' direction 
setting out from the at least one termination fragment and 
said at least one termination fragment being unable to anneal 
to any primer of the at least two primer sets in step e) 
5 during the molecular amplification procedure. 

29. A method according to claim 28, wherein the at least one 
termination fragment comprises or is a chemically modified 
nucleotide sequence . 

30. A method according to claim 29, wherein the chemically 

10 modified nucleotide sequence comprises a dideoxynucleotide in 
the 3 ' -end. 

31. A method according to claim 30, wherein the dideoxy- 
nucleotide is covalently attached to the nucleotide strand. 

32. A method according to any of the preceding claims, where- 
15 in the ligation of adapter and/or termination fragments to 

the cleaved cDNA fragments in step d) is achieved by anneal- 
ing the adapter fragments to sticky ends of the cDNA result- 
ing from the cleavage in step c) and subjecting the product 
to the action of an enzyme having DNA ligase activity. 

20 33. A method according to any of the preceding claims, where- 
in the at least one set of amplification primers of formula I 
or II wherein nl is 2* 0 has nl-1, nl==2, nl=3, or nl=4. 

34. A method according to any of the preceding claims, where- 
in nl=0 in the at least one set of amplification primers 

25 having formula I or II. 

35. A method according to claim 34, wherein the set of ampli- 
fication primers having nl=0 in formula I or II is labelled. 



36. A method according to any of the preceding claims, where- 
in the set of amplification primers having formula I or II 
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wherein nl>0 comprises all possible combinations and permuta- 
tions of A, G i T and C in the group N nl . 

37. A method according to any of the preceding claims, where- 
in the ligated cDNA fragments are sub-divided into a number 

5 of pools prior to the molecular amplification in step e) , 

each pool being subjected to the amplification using a subset 
of the set of amplification primers, 

38. A method according to claim 36, wherein the subset of 
amplification primers used for each pool comprises a primer 

10 as defined in claim 35. 

39. A method according to any of the preceding claims, which 
comprises the use of one amplification primer as defined in 
claim 35, and of one set of primers as defined in claim 36. 

40. A method according to claim 39, wherein the ligated cDNA 
15 fragments of step d) are subdivided into 4 nl pools which are 

each subjected separately to step e) wherein is used one 
amplification primer as defined in claim 35 and one primer 
from the set of amplification primers as defined in claim 36, 
said one primer being distinct from any one of the primers 
20 used for amplifying any of the other pools. 

41. A method according to any of the preceding claims, which 
comprises the further step of separating amplified fragments 
obtained from the molecular amplification procedure. 

42. A method according to claim 41, wherein the separation is 
2 5 performed by gel electrophoresis or chromatography. 

43. A method according to claim 41 or 42, which further 
comprises the step of identifying separated amplified frag- 
ments . 

44. A method according to claim 43, wherein the identifica- 
30 tion is performed by visualization. 
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45. A method according to claim 44, wherein labelled nucleo- 
tides are visualized, the labelled nucleotides being part of 
a probe or of the amplified fragments. 

46. A method according to claim 45, wherein the labelled 

5 nucleotides are the labelled nucleotides being part of the 
labelled primers as defined in claim 35. 

47. A method according to claim 46, wherein the visualization 
is performed by incorporating radioactive or fluorescent 
alpha dNTP into the cDNA fragment during PCR, where N = A, C, 

10 T, U or G. 

48. A method for determining the presence of an expression 
product in a cell or group of cells, the method comprising 
providing an RNA- containing sample from the cell or group of 
cells and subjecting the sample to the method according to 

15 any of claims 1-47, and thereafter performing a comparison of 
the thus identified amplified cDNA fragments with a database 
output, said database output comprising a computer -generated 
list of molecular weights of restriction DNA fragments of 
known sequences, said list being prepared by 

20 - inputting and storing DNA sequence data in a database as 
virtual DNA sequences, 

subsequently simulating cleavage of the virtual DNA 
sequences with the at least one restriction nuclease and 
storing the resulting simulated cleavage products as 

25 virtually cleaved DNA fragments, 

simulating ligation to the virtually cleaved fragments of 
the at least two adapter fragments and storing the re- 
sults as virtually ligated DNA fragments, 
for each individual combination of primers used in step 

30 e) , grouping the virtually ligated DNA fragments suscep- 

tible to amplification by said combination of primers in 
the same group, 
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determining, in each group, the absolute and/or relative 
molecular weight of each virtually ligated DNA fragment, 
and 

outputting the content of each group in the form of a 
5 list comprising the absolute and/or relative molecular 

weights of the virtually ligated fragments in the group. 

49. A method according to claim 48, wherein the input DNA 
sequence data are linked to data relating to the genetic 
origin of the DNA sequence data and optionally to data relat- 
10 ing to functional features relating to the genetic origin. 



50. A method according to claim 48, wherein the output indi- 
cation further comprises information about the genetic origin 
of the virtually ligated DNA fragment and optionally informa- 
tion about functional features associated with the genetic 

15 origin. 

* 

51. A method according to any of claims 3 8-50, wherein the 
comparison is performed by inputting the identified amplified 
cDNA fragments in a format which allows automated comparison 
with the database output, or, alternatively, by outputting 

2 0 the database output in a format which allows for direct 

comparison between the separated amplified cDNA fragments and 
the database output. 

52. A method for determining change in expression, compared 
to the expression in a reference cell or reference group of 

25 cells, of an expression product in a cell or group of cells 
which has been subjected to a first set of conditions in- 
fluencing the expression pattern of said cell or group of 
cells, said reference cell or group of cells being subjected 
to a second set of conditions, the method comprising pro- 

30 viding an RNA- containing sample from the cell or group of 
cells and subjecting the sample to the method according to 
any of claims 43-47 thereby obtaining data describing the 
amplified cDNA fragments derived from the sample, providing 
reference data describing amplified cDNA fragments derived 
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from an RNA- containing reference sample from the reference 
cell or reference group of cells, the reference data being 
obtained by having previously subjected the reference sample 
to the method according to any of claims 43-47, 
5 subsequently performing a comparison of the data and the 
reference data to identify those cDNA fragments which are 
expressed at different levels in the two data sets, and 
thereafter using the differentially expressed cDNA fragments 
to determine which expression products are subject to a 
10 change in expression level. 

53* A method according to claim 52, wherein the data and 
reference data are selected from the group consisting of the 
apparent molecular weights of the amplified DNA fragments, 
the M r of the amplified DNA fragments, the absolute amount of 
15 the amplified DNA fragments, and the relative amounts of the 
amplified DNA fragments. 

54. A method according to claim 52, wherein the reference 
data are extracted from a database containing the reference 
data defined in claim 53 and optionally further information 

20 relating to the genetic origin of each amplified cDNA frag- 
ment from the reference. 

55. A method for diagnosing a disease in a subject, said 
disease being characterized by a deviating expression level 
of at least one expression product in at least one cell type, 

25 the method comprising providing an RNA- containing sample 
derived from the at least one cell type, subjecting the 
sample to the method according to any of claims 43-47 thereby 
obtaining data describing the amplified cDNA fragments deri- 
ved from the sample, providing reference data describing 

30 amplified cDNA fragments derived from a RNA- containing refe- 
rence sample derived from the same type of cell from a sub- 
ject not suffering from the disease, the reference data being 
obtained by having previously subjected the reference sample 
to the method according to any of claims 43-47, and 
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subsequently performing a comparison of the data and the 
reference data with respect to those cDNA fragments which are 
known to be related to the disease, and assessing whether a 
significant difference in the data and reference data exists 
5 so as to establish whether the expression level of the ex- 
pression product deviates or not. 

5G. A method according to claim 55, wherein the data and 
reference data are selected from the group consisting of the 
apparent molecular weights of the amplified DNA fragments, 
10 the M, of the amplified DNA fragments, the absolute amount of 
the amplified DNA fragments, and the relative amounts of the 
amplified DNA fragments. 

57. A method according to claim 55, wherein the reference 
data are extracted from a database containing the reference 

15 data defined in claim 56 and optionally further information 
relating to the genetic origin of each amplified cDNA frag- 
ment from the reference. 

58. A method of synthesizing first strand cDNA, the method 
comprising subjecting a sample comprising mRNA to reverse 

20 transcription wherein, in a first step performed at a tempe- 
rature not exceeding 55°C, a first enzyme is used having a 
substantial reverse transcriptase activity at said tempera- 
ture not exceeding 55°C, and, in a subsequent second step 
performed at an elevated temperature in the range of 4 5°C - 

25 95°C, a second enzyme is used having a substantial reverse 
transcriptase activity at said elevated temperature, said 
first enzyme being substantially inactive. 

59. A method according to claim 58, wherein both enzymes are 
present in both steps. 

30 60. A method according to claim 58, wherein said first enzyme 
has a substantially higher activity than said second enzyme 
in catalyzing reverse transcription at said temperature not 
exceeding 55°C. 
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61. A method according to claim 59 or 60, wherein said second 
enzyme has a substantially higher activity than said first 
enzyme in catalyzing reverse transcription at said elevated 
temperature . 

62. A method according to any of claims 58-61, wherein said 
first enzyme is selected from the group consisting of non- 
thermostable reverse transcriptases and wherein said second 
enzyme is selected from the group consisting of thermostable 
DNA polymerases with reverse transcriptase activities. 

63. A composition for use in reverse transcription of RNA, 
the composition comprising 

a) a first enzyme having reverse transcriptase activity 
at temperatures not exceeding 55°C 

b) a second enzyme having reverse transcriptase activity 
at elevated temperatures in the range of 45°C - 95°C / 

said second enzyme having a substantially higher activity 
than said first enzyme in catalyzing reverse transcription at 
said elevated temperatures. 

64. A composition according to claim 63, wherein said first 
enzyme has a substantially higher activity than said second 
enzyme in catalyzing reverse transcription at said tempera- 
tures not exceeding 55°C. 

65. Use of a thermostable enzyme having reverse transcriptase 
activity in the preparation of a composition for use in 
reverse transcription of RNA which has previously been in 
vitro reverse transcribed by another enzyme having reverse 
transcriptase activity . 

66. A method of preparing a surface (chip) coated with cDNA 
fragments, the method comprising 
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subjecting an RNA-containing sample to the method of any 
of claims 41-47, and 

transferring the separated amplified cDNA fragments to a 
chip surface adapted to stably bind the separated ampli- 
5 fied cDNA fragments while maintaining the spatial relati- 

ve distribution pattern thereof. 

67. A method of preparing a surface (chip) coated with cDNA 
fragments, the method comprising 

subjecting an RNA-cont aining sample to the method of any 

10 of claims 1-40, 

separating, by electrophoresis, the thus obtained ampli- 
fied cDNA fragments on a particular surface adapted to 
stably bind the separated amplified cDNA fragments while 
maintaining the relative distribution pattern after 

15 electrophoresis . 

68. A method according to claim 67, wherein the electrophore- 
sis is in the form of microelectrophoresis. 

69. A method according to any of claims 66-68, wherein the 
transfer is accomplished by a electrophoret ic blotting tech- 

20 nique. 

70. A method according to any of claims 66-68, wherein the 
transfer is accomplished by photo-activated organic or inor- 
ganic chemistry techniques. 

71. A surface having cDNA stably bound thereto, said surface 
25 being obtainable by the method according to any of claims 66- 

70. 

72. A method for the screening for genes within a family of 
genes, the method comprising providing a surface according to 
claim 71, wherein cDNA stably bound to the surface is hybri- 

30 dized under low stringency conditions to a detectably label- 



RECTIFIED SHEET (RULE 91) 
ISA/EP 



WO 98/51789 



56 



PCT/DK98/00186 



led nucleic acid which is a representative of a gene family, 
and subsequently analyzing fragments of the chip to which 
hybridization has occurred so as to determine whether such 
fragments are related to the same gene family. 

73. A method for determining the difference in expression 
pattern between a first cell or type of cells and a second 
cell or type of cells, the method comprising providing sam- 
ples of labelled RNA or cDNA from the first and second cells 
or cell types and subsequently contacting each of these 
samples with a surface according to claim 71, and subsequent- 
ly detecting the amount and distribution of bound labelled 
RNA or cDNA from each sample. 

74. A method for screening for interactions between a pre- 
selected protein and a polypeptide fragment, Lhe method 
comprising preparing a sub-divided library of amplified cDNA 
fragments according to the method of any of claims 1-47, 
optionally adapting the terminals of the members of the 
library so as to facilitate insertion into a vector, insert- 
ing the fragments into vectors, transforming a population of 
suitable host cells with the vectors, culturing the host 
cells under conditions which enable expression of correctly 
inserted cDNA fragments by the host cell, and subsequently 
assaying polypeptide fragments encoded by the inserted cDNA 
fragments for interaction with the pre-selected protein. 

75. A method according to claim 74, wherein assaying of the 
polypeptide fragments is performed by a two-hybrid technique, 
wherein the host cells are eukaryotic cells which are mated 
or transfected with nucleic acid material encoding the pre- 
selected protein, successful mating/transf ection of the 
cell(s) resulting in a cell or cells wherein the interaction 
between the pre-selected protein and a polypeptide fragment 
gives rise to a detectable signal. 

76. A method according to claim 75, wherein a fungus, such as 
a yeast cell, is used in the mat ing/ trans f ection . 
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77. A method according to claim 75 or 76, wherein the delect- 
ihle signal is provided by Green Fluorescent Protein. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-57,66-77 

A method for preparing a normalized sub-divided library of 
amplified cONA fragments from the coding region of mRNA 
contained in in a sample, the method comprising the steps of 
a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer, b) 
synthesizing second strand cDNA complementary to the first 
strand cDNA fragments by use of the first strand DNA 
fragmnets as templates, and a second primer and c) 
subjecting the cDNA fragments obtained in step b) to a 
molecular amplification procedure so as to obtain amplified 
cDNA fragments, wherein is used a set of amplification 
primers; a method for preparing a normalized sub-divided 
library of amplified cDNA fragments from the coding region 
of mRNA contained in in a sample, the method comprising the 
steps of a) subjecting the mRNA derived from the sample to 
reverse transcription using at least one cDNA primer, b) 
synthesizing second strand cDNA complementary to the first 
strand cDNA fragments by use of the first strand DNA 
fragmnets as templates, thereby obtaining double stranded 
cDNA fragments, c) digesting the double stranded cDNA 
fragments with at least one restriction endonuclease, 
thereby obtaining cleaved cDNA fragments, d) ligating at 
least two adapter fragments to the cleaved cDNA fragments 
obtained in step c), so as to obtain ligated cDNA fragments, 
and e) subjecting the ligated cDNA fragments in step d) to a 
molecular amplification procedure so as to obtain amplified 
cDNA fragments, wherein is used, for an adapter fragment 
used in step d) , a set of amplification primers; a method 
for determining the presence of an expression product in a 
cell or a group of cells using said method; a method for 
determining change in expression, compared to the expression 
in a reference cell or reference group of cells; a method of 
diagnosing a disease in a subject; a method of preparing a 
surface (chip) coated with cDNA fragments; a method for 
screening for genes within a family of genes; a method for 
screening for interactions between a pre-selected protein 
and a polypeptide fragment; said method whrein assaying of 
the polypeptide fragments is performed by a two-hybrid 
technique; 



2. Claims: 58-65 

A method of synthesizing first strand cDNA, the method 
comprising subjecting a sample comprising mRNA to reverse 
transcription wherein, in a first step performed at a 
temperature not exceeding 55°C, a first enzyme is used 
having a substantial reverse transcriptase activity at a 
temperature not exceeding 55°C, and, in a subsequent second 
step performed at an elevated temperature in the range of 
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45°C-95°C, a second enzyme is used having a substantial 
reverse transcriptase activity at said elevated temperature, 
said first enzyme being substantialy inactive; a composition 
for use in reverse transcription of RNA, the composition 
comprising a) a first enzyme having reverse transcriptase 
activity at temperatures not exceeding 55°C b) a second 
enzyme having reverse transcriptase activity at elevated 
temperatures in the range of 45°C - 95°C, said first enzyme 
has substantially higher activity than said second enzyme in 
catalyzing reverse transcription at said temperature not 
exceeding 55°C; use of thermostable enzyme having reverse 
transcriptase activity in the preparation of a composition 
for use in reverse transcription of RNA which has previously 
been in vitro reverse transcribed by another enzyme having 
reverse transcriptase activity; 
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